Search CORE

7 research outputs found

Optimal Coded Diffraction Patterns for Practical Phase Retrieval

Author: Ling Bingo Wing-Kuen
Lun Daniel Pak-Kong
Wang Li-Wen
Ye Qiuliang
Publication venue
Publication date: 28/03/2023
Field of study

Phase retrieval, a long-established challenge for recovering a complex-valued signal from its Fourier intensity measurements, has attracted significant interest because of its far-flung applications in optical imaging. To enhance accuracy, researchers introduce extra constraints to the measuring procedure by including a random aperture mask in the optical path that randomly modulates the light projected on the target object and gives the coded diffraction patterns (CDP). It is known that random masks are non-bandlimited and can lead to considerable high-frequency components in the Fourier intensity measurements. These high-frequency components can be beyond the Nyquist frequency of the optical system and are thus ignored by the phase retrieval optimization algorithms, resulting in degraded reconstruction performances. Recently, our team developed a binary green noise masking scheme that can significantly reduce the high-frequency components in the measurement. However, the scheme cannot be extended to generate multiple-level aperture masks. This paper proposes a two-stage optimization algorithm to generate multi-level random masks named

\textit{OptMask}

that can also significantly reduce high-frequency components in the measurements but achieve higher accuracy than the binary masking scheme. Extensive experiments on a practical optical platform were conducted. The results demonstrate the superiority and practicality of the proposed

\textit{OptMask}

over the existing masking schemes for CDP phase retrieval

arXiv.org e-Print Archive

Regularity scalable image coding based on wavelet singularity detection

Author: BINGO WING-KUEN LING
Canny J.
CHARLOTTE YUK-FAN HO
DANIEL PAK-KONG LUN
PETER KWONG-SHUN TAM
TAI-CHIU HSUNG
WAN-CHI SIU
Publication venue: 'World Scientific Pub Co Pte Lt'
Publication date: 01/01/2008
Field of study

In this paper, we propose an adaptive algorithm for scalable wavelet image coding, which is based on the general feature, the regularity, of images. In pattern recognition or computer vision, regularity of images is estimated from the oriented wavelet coefficients and quantified by the Lipschitz exponents. To estimate the Lipschitz exponents, evaluating the interscale evolution of the wavelet transform modulus sum (WTMS) over the directional cone of influence was proven to be a better approach than tracing the wavelet transform modulus maxima (WTMM). This is because the irregular sampling nature of the WTMM complicates the reconstruction process. Moreover, examples were found to show that the WTMM representation cannot uniquely characterize a signal. It implies that the reconstruction of signal from its WTMM may not be consistently stable. Furthermore, the WTMM approach requires much more computational effort. Therefore, we use the WTMS approach to estimate the regularity of images from the separable wavelet transformed coefficients. Since we do not concern about the localization issue, we allow the decimation to occur when we evaluate the interscale evolution. After the regularity is estimated, this information is utilized in our proposed adaptive regularity scalable wavelet image coding algorithm. This algorithm can be simply embedded into any wavelet image coders, so it is compatible with the existing scalable coding techniques, such as the resolution scalable and signal-to-noise ratio (SNR) scalable coding techniques, without changing the bitstream format, but provides more scalable levels with higher peak signal-to-noise ratios (PSNRs) and lower bit rates. In comparison to the other feature-based wavelet scalable coding algorithms, the proposed algorithm outperforms them in terms of visual perception, computational complexity and coding efficienc

University of Lincoln Institutional Repository

The Hong Kong Polytechnic University Pao Yue-kong Library

Crossref

PolyU Institutional Repository

Improved MPEG-4 still texture image coding under noisy environment

Author: Daniel Pak-Kong Lun
T.C.L. Chan
Tai-Chiu Hsung
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date
Field of study

Crossref

A total-internal-reflection-based Fabry–Pérot resonator for ultra-sensitive wideband ultrasound and photoacoustic applications

Author: Daniel Pak-Kong Lun
Mengqi Shen
Michael G. Somekh
Wen Chen
Xiaoping Jiang
Publication venue: 'Elsevier BV'
Publication date: 01/04/2023
Field of study

In photoacoustic and ultrasound imaging, optical transducers offer a unique potential to provide higher responsivity, wider bandwidths, and greatly reduced electrical and acoustic impedance mismatch when compared with piezoelectric transducers. In this paper, we propose a total-internal-reflection-based Fabry–Pérot resonator composed of a 12-nm-thick gold layer and a dielectric resonant cavity. The resonator uses the same Kretschmann configuration as surface plasmon resonators (SPR). The resonators were analyzed both theoretically and experimentally. The experimental results were compared with those for an SPR for benchmarking. The 1.9-μm-thick-PMMA- and 3.4-μm-thick-PDMS-based resonators demonstrated responsivities of 3.6- and 30-fold improvements compared with the SPR, respectively. The measured bandwidths for the PMMA, PDMS devices are 110 MHz and 75 MHz, respectively. Single-shot sensitivity of 160 Pa is obtained for the PDMS device. The results indicate that, with the proposed resonator in imaging applications, sensitivity and the signal-to-noise ratio can be improved significantly without compromising the bandwidth

Directory of Open Access Journals

PCB Soldering Defect Inspection Using Multitask Learning under Low Data Regimes

Author: Daniel Pak-Kong Lun
Huu-Thanh Nguyen
Sik-Ho Tsang
Tom Tak-Lam Chan
Zhaoqing Suo
Publication venue: Wiley
Publication date: 01/12/2023
Field of study

To increase the reliability of the printed circuit board (PCB) manufacturing process, automated optical inspection is often employed for soldering defect detection. However, traditional approaches built on handcrafted features, predefined rules, or thresholds are often susceptible to the variation of the acquired images’ quality and give unstable performances. To solve this problem, a deep learning‐based soldering defect detection method is developed in this article. Like many real‐life deep learning applications, the number of available training samples is often limited. This creates a challenging low‐data scenario, as deep learning typically requires massive data to perform well. To address this issue, a multitask learning model is proposed, namely, PCBMTL, that can simultaneously learn the classification and segmentation tasks under low‐data regimes. By acquiring the segmentation knowledge, classification performance is substantially improved with few samples. To facilitate the study, a soldering defect image dataset, namely, PCBSPDefect, is built. It focuses on the dual in‐line packages (DIP) at the PCB back side, DIP at the PCB front side, and flat flexible cables. Experimental results show that the proposed PCBMTL outperforms the best existing approaches by over 5–17% of average accuracy for different datasets

Directory of Open Access Journals

DeepGIN: Deep Generative Inpainting Network for Extreme Image Inpainting

Author: Li Chu-Tak
Liu Zhi-Song
Lun Daniel Pak-Kong
Siu Wan-Chi
Wang Li-Wen
Publication venue: HAL CCSD
Publication date: 17/08/2020
Field of study

International audienceThe degree of difficulty in image inpainting depends on the types and sizes of the missing parts. Existing image inpainting approaches usually encounter difficulties in completing the missing parts in the wild with pleasing visual and contextual results as they are trained for either dealing with one specific type of missing patterns (mask) or unilaterally assuming the shapes and/or sizes of the masked areas. We propose a deep generative inpainting network, named DeepGIN, to handle various types of masked images. We design a Spatial Pyramid Dilation (SPD) ResNet block to enable the use of distant features for reconstruction. We also employ Multi-Scale Self-Attention (MSSA) mechanism and Back Projection (BP) technique to enhance our inpainting results. Our Deep-GIN outperforms the state-of-the-art approaches generally, including two publicly available datasets (FFHQ and Oxford Buildings), both quantitatively and qualitatively. We also demonstrate that our model is capable of completing masked images in the wild

arXiv.org e-Print Archive

HAL-Polytechnique

Wavelet based speech presence probability estimator for speech enhancement

Author: Alam
Cohen
Cohen
Daniel Pak-Kong Lun
Daubechies
Davis
Dominic K.C. Ho
Donoho
Donoho
Ephraim
Ephraim
Ephraim
Gerkmann
Hu
Huber
Jansen
Kim
Kum
Leporini
Loizou
Lu
Moulin
Nowak
Percival
Picklands
Sendur
Shao
Soon
Soon
Stein
Tai-Chiu Hsung
Tak-Wai Shen
Tasmaz
Uderea
Udrea
Varga
Vizireanu
Walden
Publication venue: 'Elsevier BV'
Publication date
Field of study

Crossref